Robust Scatter Matrix Estimation for High Dimensional Distributions with Heavy Tails
نویسندگان
چکیده
This paper studies large scatter matrix estimation for heavy tailed distributions. The contributions of this paper are twofold. First, we propose and advocate to use a new distribution family, the pair-elliptical, for modeling the high dimensional data. The pair-elliptical is more flexible and easier to check the goodness of fit compared to the elliptical. Secondly, built on the pair-elliptical family, we advocate using quantile-based statistics for estimating the scatter matrix. For this, we provide a family of quantilebased statistics. They outperform the existing ones for better balancing the efficiency and robustness. In particular, we show that the propose estimators have comparable performance to the moment-based counterparts under the Gaussian assumption. The method is also tuning-free compared to Catoni’s M-estimator for covariance matrix estimation. We further apply the method to conduct a variety of statistical methods. The corresponding theoretical properties as well as numerical performances are provided. Keyword: Heavy-tailed distribution; Pair-Elliptical Distribution; Quantile-based statistics; Scatter matrix.
منابع مشابه
Robust Estimation of Transition Matrices in High Dimensional Heavy-tailed Vector Autoregressive Processes
Gaussian vector autoregressive (VAR) processes have been extensively studied in the literature. However, Gaussian assumptions are stringent for heavy-tailed time series that frequently arises in finance and economics. In this paper, we develop a unified framework for modeling and estimating heavy-tailed VAR processes. In particular, we generalize the Gaussian VAR model by an elliptical VAR mode...
متن کاملEvaluation and Application of the Gaussian-Log Gaussian Spatial Model for Robust Bayesian Prediction of Tehran Air Pollution Data
Air pollution is one of the major problems of Tehran metropolis. Regarding the fact that Tehran is surrounded by Alborz Mountains from three sides, the pollution due to the cars traffic and other polluting means causes the pollutants to be trapped in the city and have no exit without appropriate wind guff. Carbon monoxide (CO) is one of the most important sources of pollution in Tehran air. The...
متن کاملOn the Realized Risk of High-Dimensional Markowitz Portfolios
We study the realized risk of Markowitz portfolio computed using parameters estimated from data and generalizations to similar questions involving the out-of-sample risk in quadratic programs with linear equality constraints. We do so under the assumption that the data is generated according to an elliptical model, which allows us to study models where we have heavy-tails, tail dependence, and ...
متن کاملA Study of Skewed Heavy-tailed Distributions as Scale Mixtures
In this paper, we study and compare different proposals of heavy-tailed (possibly skewed) distributions as robust alternatives to the normal model. The density functions are all represented as scale mixtures which enables efficient Bayesian estimation via Markov chain Monte Carlo (MCMC) methods. However, while the symmetric versions of these distributions are able to model heavy tails they of c...
متن کاملOn linear models with long memory and heavy-tailed errors
AMS subject classifications: 62J05 60G18 60G51 Keywords: Bahadur representation Heavy tails Long memory M-estimation a b s t r a c t We consider the robust estimation of regression parameters in linear models with long memory and heavy-tailed errors. Asymptotic Bahadur-type representations of robust estimates are developed and their limiting distributions are obtained. It is shown that the limi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015